Weighted pairwise scatter to improve linear discriminant analysis
نویسندگان
چکیده
Linear Discriminant Analysis (LDA) aims to transform an original feature space to a lower dimensional space with as little loss in discrimination as possible. We introduce a novel LDA matrix computation that incorporates confusability information between classes into the transform. Our goal is to improve discrimination in LDA. In conventional LDA, a between class covariance matrix that is based on the scatter of class means around the global mean is used. By rewriting the between class covariance expression in a more revealing way, we unveil that each class pair is considered equally confusable in the conventional LDA. We introduce a weighting factor for each pairwise scatter that enables to integrate the confusability information into the between class covariance matrix. There are many possibilities to choose the weighting factors. We consider few of them that depend on Euclidean and Kullback-Leibler distances between classes when a single Gaussian approximation is used for each class. The method combined with speaker cluster based transformation decreases the error rate by about relative 10% on a large vocabulary speech recognition task using IBM's speech recognition engine.
منابع مشابه
To Weight or Not to Weight: Source-Normalised LDA for Speaker Recognition Using i-vectors
Source-normalised Linear Discriminant Analysis (SNLDA) was recently introduced to improve speaker recognition using i-vectors extracted from multiple speech sources. SNLDA normalises for the effect of speech source in the calculation of the between-speaker covariance matrix. Sourcenormalised-and-weighted (SNAW) LDA computes a weighted average of source-normalised covariance matrices to better e...
متن کاملLocally Weighted Linear Discriminant Analysis for Robust Speaker Verification
Channel compensation is an integral part for any state-of-theart speaker recognition system. Typically, Linear Discriminant Analysis (LDA) is used to suppress directions containing channel information. LDA assumes a unimodal Gaussian distribution of the speaker samples to maximize the ratio of the between-speaker variance to within-speaker variance. However, when speaker samples have multi-moda...
متن کاملLinear dimensionality reduction using relevance weighted LDA
The linear discriminant analysis (LDA) is one of the most traditional linear dimensionality reduction methods. This paper incorporates the inter-class relationships as relevance weights into the estimation of the overall within-class scatter matrix in order to improve the performance of the basic LDA method and some of its improved variants. We demonstrate that in some specific situations the s...
متن کاملPairwise-Covariance Linear Discriminant Analysis
In machine learning, linear discriminant analysis (LDA) is a popular dimension reduction method. In this paper, we first provide a new perspective of LDA from an information theory perspective. From this new perspective, we propose a new formulation of LDA, which uses the pairwise averaged class covariance instead of the globally averaged class covariance used in standard LDA. This pairwise (av...
متن کاملFace recognition using nonparametric-weighted Fisherfaces
This study presents an appearance-based face recognition scheme called the nonparametric-weighted Fisherfaces (NW-Fisherfaces). Pixels in a facial image are considered as coordinates in a high-dimensional space and are transformed into a face subspace for analysis by using nonparametric-weighted feature extraction (NWFE). According to previous studies of hyperspectral image classification, NWFE...
متن کامل